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Abstract 

We revisit the complexity of online computation in the cell probe model. We consider 
a class of problems where we are first given a fixed pattern or vector C of n symbols and 
then one symbol arrives at a time in a stream. After each symbol has arrived we must 
output some function of F and the n-length suffix of the arriving stream. Cell probe 
bounds of e((51gn/w) have previously been shown for both convolution and Hamming 
distance in this setting, where 5 is the size of a symbol in bits and w € O(lgn) is the 
cell size in bits. However, when (5 is a constant, as it is in many natural situations, 
these previous results no longer give us non-trivial bounds. 

We introduce a new lop-sided information transfer proof technique which enables us 
to prove meaningful lower bounds even for constant size input alphabets. We use our 
new framework to prove an amortised cell probe lower bound of Bflg^ n/{w ■ Iglgn)) 
time per arriving bit for an online version of a well studied problem known as pattern 
matching with address errors. This is the first non-trivial cell probe lower bound for 
any online problem on bit streams that still holds when the cell sizes are large. We 
also show the same bound for online convolution conditioned on a new combinatorial 
conjecture related to Toeplitz matrices. 


1 Introduction 

We revisit the complexity of online computation in the cell probe model. In recent years 
there has been considerable progress towards the challenging goal of establishing lower 
bounds for both static and dynamic data structure problems. A third class of data struc¬ 
ture problems, one which falls somewhere between these two classic settings, is online 
computation in a streaming setting. Here one symbol arrives at a time and a new output 
must be given after each symbol arrives and before the next symbol is processed. The key 
conceptual difference to a standard dynamic data structure problem is that although each 
arriving symbol can be regarded as a new update operation, there is only one type of query 
which is to output the latest value of some function of the stream. 
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Online pattern matching is particularly suited to study in this setting and cell probe 
lower bounds have previously been shown for different measures of distance including 
Hamming distance, inner product/convolution and edit distance [5l|6lll]. All these previous 
cell probe lower bounds have relied on only one proof technique, the so-called information 
transfer technique of Patra§cu and Demaine [15]. In loose terms the basic idea is as follows. 
First one defines a random input distribution over updates. Here we regard an arriving 
symbol as an update and after each update we perform one query which simply returns the 
latest distance between a predefined pattern and the updated suffix of the stream. Then 
one has to argue that knowledge of the answers to ^ consecutive queries is sufficient to 
infer at least a constant fraction of the information encoded by ^ consecutive updates that 
occurred in the past. If one can show this is true for all power of two lengths (. and ensure 
there is no double counting then the resulting lower bound follows by summing over all 
these power of two lengths. 

For the most natural cell size w G Il(lgn), a cell probe lower bound of ^{5\gn/w) for 
both Hamming distance and convolution using this method was shown, where 5 is the size 
of an input symbol, w is the cell size in bits and n is the length of the fixed pattern 13 El- 
When 6 = w, there is also a matching upper bound in the cell probe model and so no further 
progress is possible. However, when the symbol size 6 is in fact constant, the best lower 
bound that is derivable reduces trivially to be constant. This is a particularly unfortunate 
situation as arguably the most natural setting of parameters is when the input alphabet is 
of constant size but the cell size is not. 

To make matters worse, this limitation is neither specihc to pattern matching problems 
nor even to online problems in general. As it is a fundamental feature of the information 
transfer technique the requirement to have large input alphabets also applies to a wide of 
dynamic data structure problems for which the information transfer technique has been 
up to this point the lower bound method of choice. As a result we see the challenge of 
providing a new proof technique which can meaningfully handle constant sized alphabets as 
fundamental to the aim of advancing our knowledge of the true complexity of both online 
and dynamic problems. 

We introduce a new lop-sided version of the information transfer technique that enables 
us to give meaningful lower bounds for precisely this setting, that is when 5 G 0(1) and 
w G n(lgn). Our proof technique will rely on being able to show for specific problems 
that we need only i query answers to give at least a constant fraction of the information 
encoded in £lg£ updates. 

We demonstrate our new framework by first by applying it to the well studied online 
convolution problem. For this problem we give a conditional cell probe lower bound which 
depends on a new combinatorial conjecture involving Toeplitz matrices. We then show that 
it is possible to derive an identical but this time unconditional lower bound by applying the 
lop-sided information transfer technique to a problem called online pattern matching with 
address errors [2j. This measure of distance arises in pattern matching problems where 
errors occur not in the content of the data but in the addresses for where the data is stored. 
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Previous cell probe lower bounds 

Our bounds hold in a particularly strong computational model, the cell-probe model, intro¬ 
duced originally by Minsky and Papert jl2| in a different context and then subsequently 
by Fredman [8] and Yao m- In this model, there is a separation between the computing 
unit and the memory, which is external and consists of a set of cells of w bits each. The 
computing unit cannot remember any information between operations. Computation is 
free and the cost is measured only in the number of cell reads or writes (cell-probes). This 
general view makes the model very strong, subsuming for instance the popular word-RAM 
model. 

The first techniques known for establishing dynamic data structure lower bounds had 
historically been based on the chronogram technique of Fredman and Saks j9] which can 
at best give bounds of 0(lgn/lglgn). In 2004, Patra§cu and Demaine gave us the first 
n(lgn) lower bounds for dynamic data structure problems |15) . Their technique is based 
on information theoretic arguments which also form the basis for the work we present in 
this paper. Patra§cu and Demaine also presented ideas which allowed them to express 
more refined lower bounds such as trade-offs between updates and queries of dynamic data 
structures. For a list of data structure problems and their lower bounds using these and 
related techniques, see for example m- More recently, a further breakthrough was made 
by Larsen who showed lower bounds of roughly D((lgn/lglgn)^) time per operation for 
dynamic weighted range counting problem and polynomial evaluation mm- These lower 
bounds remain the state of the art for any dynamic structure problem to this day. 

1.1 Our Results 

The lop-sided information transfer technique In the standard formulation of the 
information transfer technique of Demaine and Patra§cu |14j . adjacent time intervals are 
considered and the information that is transferred from the operations in one interval of 
power of two length i to the next interval of the same length is studied. It is sufficient to 
prove that a lower bound can be given for the information transfer which applies to all 
consecutive intervals of power of two length. Conceptually a balanced tree on n leaves over 
the time axis is constructed which is known as the information transfer tree. An internal 
node V is associated with the times Iq, ti and t2 such that the two intervals s-nd 

[ti -|- l,t 2 ] span the left subtree and the right subtree of v, respectively. By summing over 
the information transfer at each node v the final lower bound is derived. 

In order to show a cell probe lower bound for bit streams, we will need to give lower 
bounds for the information transferred from intervals of time [to, ti] to later intervals [t 2 , ta] 
which are shorter than the first intervals. In particular we will want to argue about intervals 
of length ilgi and i respectively. However, making the interval lengths lop-sided requires 
us to abandon the information transfer tree and indeed almost all of the previous proof 
technique. We will instead place gaps in time between the end of one interval and the start 
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of another and argue carefully both that not too much of the information can be lost in 
these gaps and that we can still sum the information transfer over a sufficient number of 
distinct interval lengths without too much double counting. 

Our hope is that this new technique will lead to a new class of cell probe lower bounds 
which could not be proved with existing methods. 


Online convolution In the online convolution problem we are given a hxed vector 
F G {0,1}” and a stream that arrives one bit at a time. After each bit arrives we must 
output the inner product between F and a vector formed from the most recent n-length 
suffix of the stream. At the heart of our lower bound we need to show that we can recover 
n(£lg£) bits of information of a contiguous subarray of the stream of length £lg£. However, 
because the inputs are binary and to avoid a trivial lower bound, we must do so using only 
£ outputs. As we need to achieve this for a large number distinct values of i simultaneously, 
the hxed vector F we design for the online convolution problem has a carefully designed 
recursive structure. However even having overcome this hurdle, there is still the challenge 
of showing that for each separate value £, the information from successive outputs does not 
have too large an overlap. A complete resolution to this question seems non-trivial but in 
Section]^ we present a new combinatorial conjecture related to Toeplitz matrices, which if 
true, provides the desired lower bound. As a result we get the following lower bound for 
online convolution in a bit stream: 


Theorem 1 (Online convolution). Assuming Conjecture^ in the cell-probe model with 
w G O(lgre) bits per cell, for any randomised algorithm solving the online convolution 
problem on binary inputs there exist instances such that the expected amortised time per 
arriving value is 




w -IglgnJ 


Online pattern matching with address errors (L 2 -rearrangement distance) As 

our second example we give an explicit distance function for which we can now obtain the 
hrst unconditional online lower bound for symbol size 5 = 1. We consider a problem setting 
known as pattern matching with address errors and in particular the L 2 -rearrangement 
distance as dehned in [2j. Consider two strings two strings Si and S 2 both of length n 
where S 2 is a permutation of Si. Now consider the set of permutations H so that for 
all TT G H, Si[7r(0),... ,vr(n — 1)] = S 2 . The L 2 -rearrangement distance is defined to be 
min^gn \d ~ ^ is empty, that is S 2 is in fact not a permutation of Si, 

then the L 2 -rearrangement distance is dehned to be 00. When considered as an offline 
problem, for a text of length 2n and a pattern of length n, Amir et al. showed that the 
L 2 -rearrangement distance between the pattern and every n-length substring of the text 
could be computed in O(nlgn) time |2]. In the online L 2 -rearrangement problem we are 
given a hxed pattern F G {0,1}” and the stream arrives one bit at a time. After each 
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bit arrives we must output the L 2 -rearrangement distance between F and the most recent 
n-length suffix of the stream. 

As before we need to recover bits of information of a contiguous sub-array 

of the stream of length ^Ig.^. Our technique allows us to recover O(lgre) distinct bits of 
stream from each output. This is achieved by constructing F and carefully choosing a 
highly structured random input distribution for the incoming stream in such a way that the 
contributions to the output from different regions of the stream have different magnitudes. 
We can then use the result to extract distinct information about the stream from different 
parts of each output. 

Using this approach we get the following cell probe lower bound: 


Theorem 2 (Online L 2 -rearrangement). In the cell-probe model with w G O(lgn) bits 
per cell, for any randomised algorithm solving the online L 2 -rearrangement distance problem 
on binary inputs there exist instances such that the expected amortised time per arriving 
value is 




w -IglgnJ 


2 Lop-sided information transfer 

We will define the concept of information transfer, a particular set of cells probed by the 
algorithm, and explain how a bound on the size of the information transfer can be used 
when proving the overall lower bounds of Theorems and All logarithms are in base 
two. 


2.1 Notation for the online problems 

The define some notation for our online problems. There is a fixed array F G {0,1}" of 
length n and an array S G {0,1}"' of length n, referred to as the stream. We maintain S 
subject to an update operation update(x) which takes a value x G {0,1}, modifies S by 
appending x to the right of the rightmost component S[n — 1] and removing the leftmost 
component ^[O], and then outputs the inner product of F and S, that is X]ie[n] '“S’!*]), or 
alternatively the L 2 -rearrangement distance, depending on which problem we are currently 
considering. 

We let U G {0, !}"■ denote the update array which describes a sequence of n update 
operations. That is, for each t G [n], the operation UPDATE(17[t]) is performed. We will 
usually refer to t as the arrival of the value U[t]. Observe that just after arrival t, the 
values U[t-\- 1, n — 1] are still not known to the algorithm. Finally, we let the n-length array 
A denote the outputs such that for t G [n], A[t] is the output of UPDATE(C/[t]). 
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2.2 Hard distributions 

Our lower bound holds for any randomised algorithm on its worst case input. This will 
be achieved by applying Yao’s minimax principle M- We develop a lower bonnd that 
holds for any deterministic algorithm on some random input. The basic approach is as 
follows: we devise a fixed array F and describe a probability distribution for U, the n new 
values arriving in the stream S. We then obtain a lower bound on the expected rnnning 
time over the arrivals in U that holds for any deterministic algorithm. Due to the minimax 
principle, the same lower bound must then also hold for any randomised algorithm on its 
own worst case input. The amortised bound per arriving value is obtained by dividing by 
n. 

From this point onwards we consider an arbitrary deterministic algorithm running with 
some fixed array T on a random input of n values. The algorithm may depend on F. We 
refer to the choice of F and distribution for 17 as a hard distribution since it nsed to show 
a lower bound. 

2.3 Two intervals and a gap 

In order to define the concept of information transfer from one interval of arriving valnes 
in the stream to another interval of arriving values, we hrst define the set L which contains 
the interval lengths that we will consider. The purpose of the next few definitions will be 
clear once we define information transfer in the next section. We let 



To avoid cluttering the presentation with floors and ceilings, we assume throughout that 
the value of n is such that any division or power nicely yields an integer. Whenever it 
is impossible to obtain an integer we assume that suitable floors or ceilings are used. In 
particular, L contains only integers. 

For £ € L and t £ [n/2] we define the following fonr valnes: 

to = t, 

ti = to + £lg£-l, 

^2 — - 1 " 1 ; 

Ign 

t3 = t2+£-l. 

The valnes to, ti, ^2 and t^ are indeed functions of £ and t bnt for brevity we will often 
write just to instead of to{£,t), and so on, whenever the parameters £ and t are obvious 
from context. The four values dehne the three intervals [to,ti], [ti + 1,^2 — 1] and [t 2 ,t 3 ], 
referred to as the first interval, the gap and the second interval, respectively. Whenever 
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there is a risk of ambiguity of which parameters the intervals are based on, we may specify 
what (. and t they are associated with. Before we explain the purpose of the intervals we 
will highlight some of their properties. 

First observe that the intervals are disjoint and the gap has length M/\gn. The first 
interval has length ^\g^ and starts at t, where t is always in the first half of the interval 
[0, n — 1]. The second interval has length i, hence is a log-factor shorter than the first 
interval. All intervals are contained in [0, n — 1]. To see this we need to verify that ^ n — 1 
for any choice of i G L and t G [f^/2]. The largest value in L is 

. (Igi7,)4igign = 

Thus, the largest possible value of ts, obtained with i = and t = n/2 — 1, is 

^ ^-b 1 + + 5) < n, 

whenever n is sufficiently large. 

Lastly, suppose that T G L is one size larger than £ G L, that is £' = £ ■ (Ign)^. For £' 
the length of the gap is 4T/lgn, which so big that it spans the length of both intervals 
plus the gap associated with 1. To see this, observe that for sufficiently large n, 

t) — f = ^ Ig^ — 1 + -+ l 

Ign 

^ 2£lgn + -- ^ 2£lgn+ ^ ^ = 3^1gn ^ -—. 

Ig n Ig n Ig n 

2.4 Information transfer over gaps 

Towards the definition of information transfer, we define, for i G L and t G [u/2], the 
subarray Ui^t = U[to, • • • Ti] to represent the £lg£ values arriving in the stream during the 
first interval. We define the subarray = A[t 2 , • • •, ta] to represent the £ outputs during 
the second interval. Lastly we define to be the concatenation of 17[0, (to ~ 1)] and 
?7[(ti + 1), (n — 1)]. That is, Ui^t contains all values of U except for those in U^^f 

For £ G L and t G [n/2] we first define the information transfer to the gap, denoted 
to be the set of memory cells c such that c is probed during the first interval [to,ti] 
of arriving values and also probed during the arrivals of the values U[ti + f,t 2 — f] in the 
gap. Similarly we define the information transfer to the second interval, or simply the 
information transfer, denoted to be the set of memory cells c such that c is probed 
during the first interval [toTi] of arriving symbols and also probed during the arrivals of 
symbols in the second interval [^ 2 X 3 ] but not in the gap. That is, any cell c G Gt,t cannot 
also be contained in the information transfer 
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The cells in the information transfer ^ may contain information about the values 
in that the algorithm uses in order to correctly produce the outputs However, 
since cells that are probed in the gap are not included in the information transfer, the 
information transfer might not contain all the information about the values in Ui^t that 
the algorithm uses while outputting In all previous work on lower bounds where 

the information transfer technique is used, the two intervals had no gap between them, 
hence the information transfer contained all the information about the updates in 
necessary for computing Further, in all previous work, the two intervals always had 
the same length. We will see that the introduction of a gap and skewed interval lengths 
enable us to provide non-trivial lower bounds for small inputs with large cell sizes. We will 
see that the gap is small enough that a sufficiently large portion of the information about 
has to be fetched from cells in the information transfer X^ j. 

Since cells in the information transfer are by dehnition probed at some point by the 
algorithm, we can use to measure, or at least lower bound, the number of cell probes. 
As a shorthand we let = \^e,t\ denote the size of the information transfer Similarly 
we let denote the size of the information transfer to the gap. By adding up 

the sizes of the information transfers over all £ G L and certain values of t G [n/2], 
we get a lower bound on the total number of cells probed by the algorithm during the n 
arriving values in U. The choice of the values t is crucial as we do not want to over-count 
the number of cell probes. In the next two lemmas we will deal with the potential danger 
of over-counting. 

For a cell c G X^^t, we write the probe of c with respect to X^^t to refer to the hrst probe 
of c during the arrivals in the second interval. These are the probes of the cells in the 
information transfer that we count. 

Lemma 1. For any £ £ L and t,t' £ [^/2] such that \t — t'\ '^ £, if a cell c is in both Xg^t 
and Xf^ti then the probe of c with respect to X^^t and the probe of c with respect are 
distinct. 

Proof. Since t and t' are at least £ apart, the second intervals associated with t and t', 
respectively, must be disjoint. Hence the probe of c with respect to X^^t and the probe of c 
with respect X^^t' must be distinct. □ 

From the previous lemma we know that there is no risk of over-counting cell probes of 
a cell over information transfers X^ ^ under a hxed value of £ G X, as long as no two values 
of t are closer than £. In the next lemma we consider information transfers under different 
values oi £ £ L. 

Lemma 2. For any £,£' £ L such that £ 7 ^ £', and any t,t' £ [’^/2], if a cell c is in both 
X/i^t and Xp^t' then the probe of c with respect to Xi^t and the probe of c with respect X^^i^t' 
must be distinct. 



Proof. Let p be the probe of c with respect to and let p' be the probe of c with respect 
We will show that p p'■ Suppose without loss of generality that i < P. From the 
properties of the intervals that were given in the previous section we know that the length 
of the gap associated with P is larger than the sum of lengths of the first interval, the gap 
and the second interval associated with 

Suppose for contradiction that p = p'. By definition of X^ the cell c is probed also in 
the first interval associated with £. Let pfirst denote any such cell probe. Because the gap 
associated with P is so large, Pfirst must take place either in the second interval or the gap 
associated with P . If pgi-st is in the gap, then c cannot be in X^/^*/. If pfirst is in the second 
interval then p' cannot equal p. □ 

In order to lower bound the total number of cell probes performed by the algorithm 
over the n arrivals in U we will define, for each £ G L, a set Ti C [n/2] of arrivals, such that 
for any distinct t,P G Tg, \t — t'\ ^ i. It then follows from Lemmasandthat 

i£L t&Ti 

is a lower bound on the number of cell probes. Our goal is to lower bound the expected 
value of this double-sum. The exact definition of will be given in Section |3.3| once we 
have introduced relevant notation. 

3 Proving the lower bound 

In this section we give the overall proof for the lower bounds of Theorems and Let 
i G L and let t G [n/2]. Suppose that Ui^t is fixed but the values in U(,^t are drawn at 
random in accordance with the distribution for U, conditioned on the fixed value of Ui^f 
This induces a distribution for the outputs We want to show that if the entropy of 
Ai^t is large, conditioned on the fixed then the information transfer is large, since 
only the variation in the inputs can alter the outputs Ai^f We will soon make this 
claim more precise. 

3.1 Upper bound on the entropy 

We write H{Ai^t \ Ui^t = u^^t) to denote the entropy of Ai^t conditioned on fixed Ui^f To¬ 
wards showing that high conditional entropy H{A^^t \ Ul,t = implies large information 
transfer we use the information transfer X^^t and the information transfer to the gap, 
to describe an encoding of the outputs A^^f. The following lemma gives a direct relationship 
between and the entropy. A marginally simpler version of the lemma, stated with 

different notation, was first given in |15) under the absence of gaps. 
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Lemma 3. Under the assumption that the address of any cell can be specified in w bits, 
for any i £ L and t £ [n/2], the entropy 

I Ue,t = Ui,t) ^ 2'u; + 2'u; • + Ge^t \ Ui^t = uefi. 

Proof. The expected length of any encoding of under fixed Ui^t is an upper bound on 
the conditional entropy of Ai^^. We use the information transfer and the information 
transfer to the gap, to dehne an encoding of Ai^t in the following way. For every cell 
c G U Ge,t we store the address of c, which takes at most w bits under the assumption 
that a cell can hold the address of any cell in memory. We also store the contents of c 
that it holds at the very end of the first interval, just before the beginning of the gap. The 
contents of c is specified with w bits. In total this requires 2w ■ + Gi^t) bits. 

We will use the algorithm, which is fixed, and the fixed values ui^t of Ui^t as part of the 
decoder to obtain Ai^t from the encoding. Since the encoding is of variable length we also 
store the size of the information transfer and the size Gi^t of the information transfer 
to the gap. This requires at most 2w additional bits. 

In order to prove that the described encoding of A^^t is valid we now describe how to 
decode it. First we simulate the algorithm on the fixed input from the first arrival U[0] 
until just before the first interval when the first value in Ui^t arrives. We then skip over all 
inputs in Ui^t and resume simulating the algorithm from the beginning of the gap, that is 
when the value U[ti + 1] arrives. We simulate the algorithm over the arrivals in the gap 
and the second interval until all values in Ai^t have been outputted. For every cell being 
read, we check if it is contained in either the information transfer or the information 
transfer to the gap by looking up its address in the encoding. If the address is found 
then the contents of the cell is fetched from the encoding. If not, its contents is available 
from simulating the algorithm on the fixed inputs Ui^t- D 

3.2 Lower bounds on entropy 

Lemma above provides a direct way to obtain a lower bound on the expected value of 
Ii^t + Gi^t if given a lower bound on the conditional entropy H[Ai^t \ In the 

next two lemmas we provide such entropy lower bounds for L 2 -rearrangement distance and 
convolution. 

Lemma 4. Assuming Conjecture^ for the convolution problem there exists a real constant 
K > 0 and, for any n, a fixed array F £ {0,1}” such that for all I £ L and t £ [n/2], when 
U is chosen uniformly at random from {0,1}” then 

H{Ai^t\Ui^t = ue^t) ^ n-i-lgn, 


for any fixed ui^t ■ 
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Lemma 5. For the L 2 -rearrangement distance problem there exists a real constant k > 0 
and, for any n, a fixed array F G {0, !}”■ such that for all i G L and all t G [n/2] such that 
t mod 4 = 0, when U is chosen uniformly at random from {0101,1010} 4 then 

H{Ae^t\Ui^t = ue^t) ^ n-i-lgn, 


for any fixed ui^t ■ 

The proof of Lemmais deferred to Section]^ and hinges on a conjecture relating to 
Toeplitz matrices. The proof of Lemma is deferred to Section 

Before we proceed with the lower bound on the information transfer we make a short 
remark on the bounds that these lemmas give. Observe that the maximum conditional 
entropy of is bounded by the entropy of which is 0{ilgi) since the length of the 
first interval is ilgi. Recall also that the values in L range from to Thus, for a 
constant k, both entropy lower bounds are tight up to a multiplicative constant factor. 


3.3 A lower bound on the information transfer and qnick gaps 


In this section we prove our main lower bound results. In order to fix ideas we do this for 
the L 2 -rearrangement problem, given in Theorem As a result we assume that n is the 
constant and F is the hxed array of Lemma and that U is chosen uniformly at random 
from {0,1}”. Theorem which gives our main lower bound result for online convolution 
follows via exactly the same argument but with Lemma replaced by Lemma 

By combining the upper and lower bounds on the conditional entropy from Lemmas 
and[^we have that there is a hard distribution and a real constant «: > 0 for the convolution 
problem such that 

+ Ge,t\Ue,t = uefi ^ _ i 


for any U£^f We may remove the conditioning by taking expectation over Ui^t under random 
U. Thus, 


E[l£^t + G£fi 


K ■ £ - Ign 


or equivalently 


E[l£,t] ^ 


K ■ £ - Ign 
2w 


l-E[G£,t]. 


( 1 ) 


Recall that our goal is to lower bound 


E 


EE''.. 

i£L teTi 


EE^^fc.!' 

teL teTi 


where is contains suitable values of t. Using Inequality Q above would immediately 
provide such a lower bound, however, there is an imminent risk that the negative terms of 
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could devalue such a bound into something trivially small. Now, for this to happen, 
the algorithm must perform sufficiently many cell probes in the gap. Since the length of 
the gap is considerably shorter than the second interval, a cap on the worst-case number 
of cell probes per arriving value would certainly ensure that stays small, but as we 

want an amortised lower bound we need something more refined. The answer lies in how 
we define T^. We discuss this next. 

For i € L and / G \i\ we first define the set C [n] of arrivals to be 




Ti '1 

i G {0,1, 2 ,... } and f + ii ^ - j ■ 


The values in Tij are evenly spread out with distance i, starting at /. We may think of / 
as the offset of the sequence of values in T^j. The largest value in the set is no more than 
n/2. We will define the set to equal a subset of one of the sets T^j for some /. More 
precisely, we will show that there must exist an offset / such that at least half of the values 
t G T^j have the property that the time spent in the gap associated with i and t is small 
enough to ensure that the information transfer to the gap is small. We begin with some 
definitions. 


Definition 1 (Quick gaps and sets). For any i € L and t G [n/2] we say that the gap 
associated with £ and t is quick if the expected number of cell probes during the arrivals 
in the gap is no more than 

K ■ £ - Ign 
Aw ’ 

where n is the constant from Lemma Further, for any / G [£\ we say that the set T^j is 
quick if, for at least half of all t G Tij, the gap associated with £ and t is quick. 

The next lemma says that for sufficiently fast algorithms there is always an offset / 
such that T^j is quick. 

Lemma 6. Suppose that the expected total number of cell probes over the n arrivals in U 
is less than 

K ■ n ■ Ig^ n 
32w 

Then, for any £ G L, there is an f G [.^j such that Tij is quick. 

Proof. In accordance with the lemma, suppose that the expected total number of cell probes 
over the n arrivals in U is less than Kn{lg^ n)/{32w). For contradiction, suppose that there 
is no / G [£] such that Tij is quick. We will show that the expected number of cell probes 
over the n arrivals must then be at least K,n{\^ n)/{2>2w). 

For any / G [£], let C [n] be the union of all arrivals that belong to a gap associated 
with £ and any t G Tij. Let Pf be the number of cell probes performed by the algorithm 
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over the arrivals in Rf. 
expectation 

E[Pf] ^ 

Let the set of offsets 


Thus, 

for any set Tij 
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The values in T are spread out with distance 4£/lgn, which equals the gap length. Thus, 
for any two distinct /, /' G T", the sets Rf and Rf are disjoint. We therefore have that the 
total running time over all n arrivals in U must be lower bounded by Under the 

assumption that no T^j is quick, we have that the expected total running time is at least 


E 


Ep/ 

/6.F 


/6.F 




l-UI- 


K ■ n - Ign 
8w 


Ign K ■ n - ign 
4 8w 


K ■ n - Ig^ n 
32w 


which is the contradiction we wanted. Thus, under the assumption that the running time 
over the n arrivals in U is less than Kn{lg^ n)/{32w) there must be an / G [l\ such that T^j 
is quick. □ 


We now proceed under the assumption that the expected running time over the n 
arrivals in U is less than Kn{\^ n)/{32w) . If this is not the case then we have already 
established the lower bound of Theorem [H 

Let / be a value in [i\ such that T^j is a quick set. Such an / exists due to Lemma 
We now let C T^j be the set of all t G Tij for which the gap associated with i and t is 
quick. Hence \Ti\ ^ \Tij\/2 = n/{M). Since Gif cannot be larger than the number of cell 
probes in the gap, we have by the dehnition of a quick gap that for any t € Ti, 


E [Gif] < 


K ■ i - Ign 
iw 


Using the above inequality together with Inequality Q we can finally provide a non-trivial 
lower bound on the sum of the information transfers: 

'K ■ £ - Ign 


E E !*.■ 

eeL t£Ti 
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Figure 1: Two subarrays Fi and F^/ of F. There is never an overlap between any two such 
subarrays. 


By Lemmas [T] and this lower bound is also a bound on the expected total number of cell 
probes performed by the algorithm over the n arrivals in U . By Yao’s minimax principle, 


as discussed in Section 2.2, this implies that any randomised algorithm on its worst case 
input has the same lower bound on its expected running time. The amortised time per 
arriving value is obtained by dividing the running time by n. This concludes the proofs of 
Theorem [2] and [T] . 


4 The hard distribution for convolution 

In this section we prove Lemma which says that assuming Conjecture holds, there 
exists a real constant k > 0 and a fixed array F G {0, !}”■ such that for all I" G L and 
t G [n/2], when U is chosen uniformly at random from {0, then the conditional entropy 
of the outputs is 

= > K-i-lgn, 

for any fixed ui^f We begin by discussing the fixed array F. 


4.1 The array F 


For each i € L there is a subarray of F of length ilgi + i — 1. Each such subarray, which 
we denote F^, is at distance 4£/lgn + 1 from the right-hand end of F, which is one more 
than the length of the gap associated with £. Figure [^illustrates two subarrays and F^i, 
where £,£' € L and i < £'. By the properties discussed in Section 2.3 we know that the 
length of the gap associated with I' is larger than the length of Fi plus the length of the 
gap associated with £. Hence there is no overlap between the subarrays Fi and Fi'. 

Given any of the subarrays F^ and an array Ug^ of length £ Ig £, we write Fg ® Ug to 
denote the Glength array that consists of all inner products between Ug and every substring 
of Fg. More precisely, for i G [£] the i-th component of Fg ® Ug is 


{Fg®UgM= Y, [Fg[i + j]-Ug[j\). 

je[£ igd 

As Figure shows we may think of Fg 0 Ug as the inner products of Ug and its aligned 
subarray of Fg as Ug slides along Fg. We can show that there exist subarrays Fg such 
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Figure 2: The ^-length array F) (8) Un contains the inner products of and corresponding 
subarrays of as Ui slides along F^. 

that the entropy of F( (8) is high when is drawn uniformly at random from {0, 
Observe that the entropy of Fi®Uf, is npper bounded by the entropy of Ui, which is exactly 
£lg£. The proof of the next lemma is based on a conjecture that we describe shortly. 

Lemma 7. There exists a real constant e > 0 such that for all n and i £ L there is a 
subarray F^ for which the entropy of F^ (8) U(, is at least e- iX'gi when Ui is drawn uniformly 
at random from {0, 

In order to finish the description of the array F we choose each subarray Fi such that 
the entropy of Fi (8) Ui is at least e ■ ilgi when is drawn uniformly at random from 
{0 ,where e is the constant of Lemma Any element of F that is not part of any 
of the subarrays F^ is chosen arbitrarily. This concludes the description of the array F. 

4.2 Proving the entropy lower bonnd 

We are now ready to prove Lemma that is lower bound the conditional entropy of 

Proof of Lemma Let F be the array described above and let U be drawn uniformly at 
random from {0, l}'^. Let i £ L and t £ [n/2]. Thus, conditioned on any fixed Ui^t, the 
distribution of is uniform on {0, 

Recall that arrives in the stream between arrival to and ti, after which 4^/ Ig n 
values arrive in the gap. Thus, at the beginning of the second interval, at arrival t 2 , 
is aligned with the (£ Ig £)-length suffix of the subarray Fi of F. Over the i arrivals in the 
second interval, slide along F^ similarly to Figure]^ (only in reversed direction of what 
the diagram shows). Since all values in are fixed, the outputs uniquely specify 
F(, ® Ui^t- Thus, by Lemma the conditional entropy 

H{Ai^t\Ui^t = ui^t) > e-i-lgi, ^ ^-e-lgn, 

since i ^ By setting the constant k to e/4 we have proved Lemma □ 

The last piece remaining is the proof of Lemma This proof is based on a conjecture 
which we state as follows. Recall that a Toeplitz matrix (or upside-down Hankel matrix) is 
constant on each descending diagonal from left to right. 


15 















Conjecture 1. There exist two positive real constants a ^ 1 and 7^1 such that for any 
h there is a (0, l)-Toeplitz matrix M of height h and width a ■ high with the property that 
the entropy of the product Mv is at least 'y ■ high, where v is a column vector of length 
a - high whose elements are chosen independently and uniformly at random from { 0 , 1 }. 

This conjecture might at first seem surprising as the matrix is non-square. However, an 
essentially equivalent statement was shown to be true for general (0, l)-matrices in 1974 
by Erdos and Spencer [7]. Moreover, even stronger statements for general (0, l)-matrices 
form the basis of the well studied “Coin Weighing Problem with a Spring Scale” (see [3] 
and references therein). We will now show how to prove Lemma [fusing the conjecture. 

Proof of Lemma^ (assuming Conjecture^ . Let a and 7 be the two constants in Conjec¬ 
ture!^ Let h = i and let M be a (0, l)-Toeplitz matrix of height £ and width a - llgl with 
the property of Conjecture [T} 

We now dehne a new matrix of height I and width Ig such that the submatrix of 
Ml that spans the first a ■ llgi columns equals M. Remaining elements of Mi are filled in 
arbitrarily from the set {0,1} so that Mi becomes Toeplitz. 

Let vi be a random column vector of length Ig such that the first a ■ ilgi elements 
are chosen independently and uniformly from {0,1}. The remaining elements are fixed 
arbitrarily. By Conjecture we have that the entropy of Mivi is at least j ■ £lg£. Thus, 
if we instead pick all elements of vi independently and uniformly at random from { 0 , 1 } 
then the conditional entropy of Mivi, conditioned on all but the first a ■ (.IgU. elements, is 
at least 7 • £ Ig £. Hence the entropy of Mivi is also at least 7 • £ Ig £. 

Now, since Mi is Toeplitz we have that the first column and the first row of Mi define 
the entire matrix. These ^ + (!.lg^ — l elements can be represented with an array Fi of length 
l + ilgl—1 such that Fi®vi = Mivi, where we abuse notation slightly. In other words, the 
elements of the product Mivi correspond exactly to the inner products obtained by sliding 
Ui, where Ui is the array version of the column vector vi, along Fi. Thus, the entropy of 
Fi ( 8 > Ui is at least 7 • I" Ig We may therefore set the constant e in the statement of the 
lemma to equal the constant 7 of Conjecture This concludes the proof of Lemma □ 


5 The hard distribution for L 2 -rearrangement 

In this section we prove Lemma which says that there exists a real constant k > 0 and 
a fixed array F G {0,1}" such that for all i € L and all t G [n./4] such that t mod 4 = 0, 
when U is chosen uniformly at random from {0101,1010} 4 then the conditional entropy 
of the outputs Ai^t is H{Ai^t \ Ui^t = ^i,t) ^ k • £ • Ign for any fixed ui^f We begin by 
discussing the fixed array F. 
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5.1 The array F 

As for the convolution problem, for each i € L there is a subarray of F of length £ Ig £ + £. 
Each such subarray, which we denote E), is at distance 4£/lgn + 1 from the right-hand end 
of F. This is the same high-level structure as for the convolution problem and so again. 


is accurate here too. 

Given any of the subarrays F^ and an array Ui of length {i Ig i ), we write E) © to 
denote the (.^/4)-length array that consists of the L 2 -rearrangement distances between Ui 
and every fourth (£lg£)-length substring of E^. More precisely, for 4i G [i], the value of 
Fi 0 Ui[i] is the E 2 -rearrangement distance between F^lAi, Ai + — 1] and U^. 

Our main focus in this section is on proving Lemma which can be seen as an analogue 
of Lemma for a fixed length of t. 

Lemma 8. There exists a real constant e > 0 such that for all n and i £ L there is a 
subarray F^ for which the entropy of F^ 0 is at least e ■ ilgi when Ui is drawn uniformly 
at random from {0101, lOlO}^ Fi contains an equal number of Os and Is. 

In order to finish the description of the array F we choose each subarray E^ in accordance 
with Lemma Any region of F that is not part of any of the subarrays E) is filled with 
repeats of ‘01’. This ensures that these regions contain an equal number of zeros and ones 
(it is easily verified that each region has an even length). This concludes the description of 
the array F. 

The proof of Lemma then follows follows from Lemma It is conceptually similar 
to the proof of Lemma for convolution which follows from Lemma However, for 
the convolution problem, our proof relied on the fact that we could essentially consider 
the convolution of each pair Fi and separately and simply add them up to find the 
convolution of F and U. This is less immediate for L 2 -rearrangement distance because 
we need to rule out the possibility of characters from some Ui being moved to positions 
in E)/ for i ^ i. The proof (and the lower bound in general) relies on a key property 
of L 2 -arrangement (proven in Lemma 3.1 from [T]) which states that under the optimal 
rearrangement permutation, the f-th one (resp. zero) in one string is moved to the z-th one 
(resp. zero) in the other. By controlling how the zeroes and ones are distributed in U and 
F, we can limit how far any character is moved. 

We are now ready to prove Lemma that is lower bound the conditional entropy of 
^e,t- 

Proof of Lemma Let F be the array described above and let U be drawn uniformly at 
random from {0101, 1010 } 4 . Let i G L and t G [n/2\. Thus, conditioned on any fixed Ui^f, 
the distribution of Ui^t is uniform on { 0101 , 1010 } 

Recall that Ui^t arrives in the stream between arrival to and ti, after which 4£/lgn 
values arrive in the gap. Thus, at the beginning of the second interval, at arrival t 2 , Ui^t 


there is no overlap between the subarrays E} and E}/ and further. Figure [Bin Section 2.3 
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is aligned with the (£ Ig t')-length suffix of the subarray Fi of F. Over the i arrivals in the 
second interval, U£^t slides along Fi similarly to Figure We now prove that since all 
values in are fixed, the outputs uniquely specify F^ 0 The analogous property 
for convolution was immediate. First observe that by construction the prefix of F up to 
the start of Fi contains an equal number of Os and Is. Similarly for Fi itself and the suffix 
from Ffi to the end of F. Once in every four arrivals, the substring of U aligned with F is 
guaranteed (by construction) to also have an equal number of Os and Is. Therefore the L 2 
rearrangement distance is finite. It was proven in Lemma 3.1 from [T] that (rephrased in 
our notation) under the optimal rearrangement permutation, the fc-th one (resp. zero) in F 
is moved to the A:-th one (resp. zero) in U. Therefore, every element of is moved to an 
element in Fg. We can therefore recover any output in FgQUi^t by taking the corresponding 
output in and subtracting, the costs of moving the elements that are in U but not in 
U£. It is easily verified that as t is divisible by four, the corresponding output in A£^t is 
one of those guaranteed to have an equal number of Os and Is. Thus, by Lemma the 
conditional entropy 

H{A£^t\U£^t = U£^t) > e-i-lgi, ^ ^-^-Ign, 
since i ^ By setting the constant k to e/4 we have proved Lemma □ 

5.2 The proof of Lemma 

In this section we prove Lemma We begin by explaining the high-level approach which 
will make one final composition of both F/ and U£ into sub-arrays. For any j ^ 0, let 
[// = Uili ■ j,i ■ (j -|- 1) — 1] i.e. [// is the j-th consecutive ^-length sub-array of Ui. The 
key property that we will prove in this section is given in Lemma which intuitively states 
that given half of the bits in Ui, we can compute the other half with certainty. 

Lemma 9. Let U£ be chosen arbitrarily from {0101,1010}4 . Given Fi, F£QU£ and 
for all j ^ 0, it is possible to uniquely determine for all j ^ 0. 

Before we prove Lemma we briefly justify why Lemma [8| is in-fact a straight-forward 

0 l_r LJ ^ 

If we pick U£ uniformly at random from {0101,1010}4 then by 
LemmalOl the conditional entropy, ff(F£ 0 [/£ | for all j) is n(£lg£). This is because 

we always recover 0(lg£) distinct , each of which is independent and has entropy 
bits. It then immediately follows that H{F£QU£) ^ H{F£QU£ \ for all j) as required. 

We also require for Lemmathat Fi contains an equal number ones and zeros. This follows 
immediately from the description of F£ below. 

The subarray F£ We now give the description of F£ which requires one final decompo¬ 
sition into subarrays. This description is supported by Figure]^ For each j G [[(lg£)/2j], 
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Figure 3: The high-level structure of Ug and Fi. 


Fi contains a subarray F^ of length i. Intuitively, each sub-array F^ will be responsible 
for recovering U'jF These subarrays occur in order in F). Before and after each F^ there 
are stretches of repeats of the string 1001. Specihcally, before F^ there are £/4 — 1 repeats 
the string 1001. Between each Fj and there are repeats of the string 1001 and 
after 1/4 + 1 repeats. These repeats of 1001 are simply for structural 

padding and as we will see the contribution of these repeated 1001 strings to the L 2 rear¬ 
rangement distance is independent of Ui. This follows because the cost of rearranging 1001 
to match either 1010 or 0101 is always 2. 

The structure of F/ is as follows F/ = Here 0^ (resp. 1^) 

is a string of z zeros (resp. ones). Intuitively, the reason that the stretch of Os at the 
start of F/ is the exponentially increasing with j is so that distance the second one in F/ 
(immediately after the stretch of Os) is forced to move also exponentially increasing with j. 
This is will allow us to recover each from a different bit in the outputs. 


Proof of Lemma We are now in a position to prove Lemma Our main focus will 
be on first proving that given F), for all j and F^ 0 Ui, we can uniquely determine 


— 4, i — 1] for each j ^ 0. That is, for each j whether the last four symbols of 
are 0101 or 1010. This is shown diagrammatically in Figure dl We argue that by a 


straight-forward repeated application of this argument we can in-fact recover the whole of 
for all j ^ 0. 
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(xxxxx denotes a repeated stretch of 1001) 


unknown 

known 


Figure d: We can determine U^^[i — d,£ — 1] if we know F^, every and F( 0 U^. 


We will begin by making some simplifying observations about (F^ 0 U£)[0]. Recall that 
{Fi 0 t/f)[0] was dehned to be the F 2 rearrangement distance between ^^[0, \Ui\ — 1] and 
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U(^. The first observation is that is finite because both strings contain an equal number of 
zeros and ones. 

The L 2 rearrangement distance {FiQU()[Q] can be expressed as the sum of the contribu¬ 
tions from moving each over all i G [m]. Let the contribution of U(\i\, denoted, CT(z) 
be the square of the distance that Ut\i\ is moved under the optimal rearrangement. We 
then have that (T^Q U()[Q] = Yhi CT(j). Finally, we let D* be the sum of the contributions 
of the locations in every — A, I —1], i.e. D* = Yk=o (CT(2j •£ + (£ — 4) + A;). We 
will also refer to the contribution of a substring which is defined naturally to be the sum of 
the contributions of its constituent characters. For example the contribution of equals 
E{CT(r)|rG [A-j,£-(j + l)-l]}. 


Our proof will be in two stages. First we will prove in Lemma 10 that we can compute 
D* from F^, 0 and for all j ^ 0. Second we will prove that for any j > 0, we 

can determine — 4, £ — 1] from D*. 

In the proof of Lemma 10 we argue that D* can be calculated directly from (Fg 0 Ui) [0] 
by subtracting the contributions of and — 5] for all j ^ 0. More specifically, 

we will prove that the contribution of any can calculated from and F^, which 

are both known. In particular, the contribution of any is independent of every 

unknown Further, we will prove that although is unknown, the contribution of 
— 5], always equals £/2 — 2, regardless of the choice of Uf,. 

Lemma 10. D* can he computed from Fi, F^ 0 Ug and for all j ^ 0. 


Proof The value of D* is calculated directly from [Fi 0 Ui) [0] by subtracting the contri¬ 
butions of and Uf^[0,1 — b] for all j ^ 0. We will now prove that for any j, we can 

calculate the contribution of and that contribution of — 5], in-fact always 

equals .^/2 — 2, regardless of the choice of Ui. 

In this proof we rely heavily on Lemma 3.1 from [T] which states that under the optimal 
rearrangement permutation, the i-th one (resp. zero) in Ui is moved to the i-th one (resp. 
zero) in F>[0, - I]. ^ ^ ^ ^ ^ ^ 

For any j, consider, u'^^ and . The number of ones in uj^ (resp. is fixed, 

independent of the choice of Ui. In particular there are exactly £/2 zeroes and £/2 ones. 
It is easily verified that, by construction, Fi\flj ■ (2j -|- 2) • I' — I] also contains exactly I 

zeros and £ ones. Therefore, the i-th one (resp. zero) in u’^^ is moved to the i-th one (resp. 
zero) in Fi[2j ■ A, (2j -|- 2) • I' — I]. Similarly, the i-th one (resp. zero) in U^^ is moved to the 
(i -|- £/2)-th one (resp. zero) in Fi[2j ■ £, (2j -|- 2) • — I] 

Consider any which is known. By the above observation, we can therefore 

determine which position in Fi[2j ■ £, (2j -|- 2) • £ — I], each character in is moved to 

under the optimal rearrangement. From this we can then directly compute the contribution 
of each to {Fi 0 Ui)[{)\. 

Consider any u'j^ which is unknown. As observed above, the i-th one (resp. zero) in 
Uf^ is moved to the i-th one (resp. zero) in Fi\2j ■ {2j -|- 2) • A — I]. By construction, we 
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have that Fi[2j ■ (2j + 1) • £ — 5] consists entirely of repeats of 1001. Further for any i, 

we have that + 3] is either 1010 or 0101. Therefore for all i < ijA: we have that 

the two ones (resp. zeroes) in + 3] are moved to the two ones (resp. zeroes) in 

F(\^j • £ + 4i, 2j • .^ + 4z + 3] = 1001. The key observation is that regardless of whether 
4z + 3] = 1010 or 0101, the contribution of U‘^^[Ai, 4i + 3] is 2. Therefore for any U^, 
the contribution of I7^^l0,£ — 5] is ll2 — 2. □ 

In Lemma 11 we will prove that we can compute U‘^^[£ — 4,.^ — 1] from D* (for any 
sufficiently large j). The intuition behind this is given by Fact which gives an explicit 
formula for the contribution of [£ — A, £ — 1]. Observe that the contribution depends 
only on whether U‘j^[£ — 4,^—1] equals 1010 {vj = 1) or 0101 {vj = 0). The intuition is 
that we can extract Vj from the (j + l)-th bit of D*. 

Fact 1. For any j, let Vj = 1 if U‘^^[£ — A, £ — 1] = 1010 and Vj = 0 otherwise. The 
contribution of U^^[£ — A, £ — 1] is exactly Vj ■ 2-^+^ + 2^-^ + 2. 


Proof. We begin by arguing that under the optimal rearrangement permutation, the two 
ones (resp. zeroes) in U‘^^[£ — 4,£ — 1 ] are moved to the leftmost two ones (resp. zeroes) in 
F^. We will again rely heavily on Lemma 3.1 from [T] which states that under the optimal 
rearrangement permutation, the i-th one (resp. zero) in Ui is moved to the z-th one (resp. 
zero) in ^^[ 0 , |C 4 | - 1 ]. 


In the proof of Lemma 10, we argued that the z-th one (resp. zero) in is moved to the 


z-th one (resp. zero) in Fi[2j ■£, {2j+ 2) ■£—!]. It is easily verified that Fi[2j ■£, {2 j+ 2)-i — l] 
consists of exactly £lA— \ repeats of 1001 followed by Therefore as U^^[0,£ — 5] contains 
exactly £/2 — 2 ones (resp. zeroes), the two ones in U'^^[£ — 4,£ — 1] are indeed moved to 
the leftmost two ones (resp. zeroes) in FL 


c/f 

Dl^ 




xxxxxxxxxxxxxx 

lob o---oTi 1 1 1 1 1---1 

0"0 

X X 


+ 3H 


Figure 5: The rearrangement of the symbols in u‘^^[£ — A, £ — 1] under the optimal 
rearrangement permutation. The highlighted region is Fj. 

We now argue about the contribution of each character in [£ — A,£—V\ in turn. This 
argument is is supported by Figure 

First consider the first one in [£ — 4, £ — 1] which is moved to the first one in F^. By 
construction we have that, when indexed from the start of F), the leftmost one in Fj is at 
position X = {2j + 1) • ^ — 4. Similarly, when indexed from the start of U^, the leftmost one 


21 










in — 1] is at position x + (1 — Vj). Therefore the contribution of the first one in 

is exactly (1 — 

The first zero in — 4,£ — 1] is moved to the first zero in F^. The first zero in 

— 4, £ — 1] is at position x + vj. The first zero in is at position x + 1. Therefore 
the contribution of the first zero in is also exactly (1 — Xj)^. 

The second zero in U‘^^[£ — 4, ^ — 1] is moved to the second zero in F^. The second zero 
in U^^[i — 4,£ — 1 ] is at position x + 2 + Vj and the second zero in Fj is at position x + 2 . 
Therefore the contribution of the first zero in is exactly 

Finally, the second one in U^^[£ — 4,£ — 1 ] is moved to the second one in F^. Again, by 
construction, we have that the second one in Fj is at position x + 3 + 2-^ (indexed from 
the start of F^). Similarly, the second one in Uf^[£ — 4,^ — 1] is at position x + 3 — Vj. 
Therefore the contribution of the second one in is exactly, (2-^ F vj)^. Summing over 
all four characters we have that the total contribution of U^^[i — 4,£ — 1] is: 

(2^ + Vj)^ + '^] + 2(1 - 

Expanding and simplifying we have that this is 


(2^1 + 2) + Vj ■ 2l+^ + 4v] - Avj 

As Uj G {0,1} we have that vj = Vj so this simplifies further to: 

Vj ■ 2 ^+^ + 2^1 + 2 


□ 


We can now prove Lemma 11 which follows almost immediately from Fact 
Lemma 11. For any j ^ 0, it is possible to compute Uj^[£ — 4,f — 1] from D*. 


Proof. Let D 2 equal D* — + 2) which can be calculated directly from D*. An 

alternative and equivalent definition of Dif follows from Fact and is given by 


D 


★ 

2 




We can therefore compute Vj and hence Uj^[£ 
bit of D 2 . 


4,t' — 1] by inspecting the (j + l)-th 

□ 
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Recovering the rest of C4,(2j) So far we have only proven that we can recover — 

4, £— 1] for all j. The claim that we can in-fact recover the whole of follows by repeatedly 
application of the the proof above. Specifically, once we have recovered U^^ [£ — 4, £ — 1] 
for all j, we can use this additional information (and 0 17£)[1] instead of 0 ?7f)[0]) 
to recover [£ — 8, f — 5] for all j and so on. More formally we proceed by induction on 
increasing k by observing that using the above argument given F^, (F^ 0 U()[k]^ 
all j ^ 0 and [£ — 4k, £ — 1] for all j ^ 0 we can recover [£ — 4k — 4,£ — 4k — 1] 
for all j. 
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